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THE RATE AT WHICH MAN HAS CONTINUOUSLY BEEN ACCUMULATING USEFUL KNOWLEDGE 
ABOUT HIMSELF AND THE ENVIRONMENT HAS ACCELERATED-GRRATLY, PARTICULARLY. DURING 
THE LAST 100 YEARS, ‘THERE ARE MANY REASONS FOR THIS INCRRASE : "HERE ARE MORE _ 
SCIENTISTS TO-DAY THAN EVER BEFORE, SCIENTIFIC RESEARCH 18 RECEIVING 
SUBSTANTIAL GRANTS AND FINANCIAL SUPPORT FROM GOVERNMENT AND INDUSTRY, THE. COLD 
WAR HAS RESULTED IN A COMPETITIVE ATMOSPHERE IN WHICH NATIONS ARE ENDRAVOURING 
FOR MILITARY AND TECHNOLOGICAL SUPREMACY, ETC. 


BY THE MOST CONSERVATIVE ESTIMATES, EUROPE WAS PUBLISHING 1000 NEW BOOKS 
PER YEAR BEFORE 1500. BY 1950, FOUR AND A HALF CENTURIES BATER, ‘HE RATE 
HAS ACCELERATED SO SHARPLY THAT EUROPE WAS PUBLISHING 120,000 NEW BOOKS 
PER YEAR. BY THE MID-SEVENTIES, THE PUBLICATION OF BOOKS ON A WORLD SCALE, 
EUROPE INCLUDED, REACHED THE FIGURE OF APPROXIMATELY 1,500 NEW BOOKS PER DAY. 


“THERE ARE MORE SCIENTIFIC SOCIETIES IN EXISTENCE TODAY THAN EVER BEFORE, 
AND THESE SOCIETIES ARE HOLDING MORE MERTINGS, CONVENTIONS AND SEMINARS, 
AND PUBLISH MORE JOURNALS AND PROCEEDINGS THAN EVER BEFORE. {HE FIRST TWO 
SCIENTIFIC JOURNALS APPEARED 300 YEARS AGO. IN 1958, THE NUMBER OF RESPONSIBLE 
TECHNICAL JOURNALS APPROACHED 100,000: ‘THERE ARE NOW FOUR MILLION JOURNAL 
ARTICLES, 120,000 TECHNICAL BOOKS AND 100,000 TECHNICAL REPORTS BEING PUBLISHED 
ANNUALLY. (ON A WORLDWIDE BASIS APPROXIMATELY 60,000,000 PAGES EVERY YEARS) 
AND THIS RATE OF PUBLICATIONS 16 INCREASING EVERY YEAR. TO-DAY, FOR EXAMPLE, 
THE NUMBER OF SCIENTIFIC AND TECHNOLOGICAL JOURNALS AND ARTICLES IS DOUBLING, 
LIKE INDUSTRIAL PRODUCTION IN THE INDUSTRIALISED COUNTRIES, ABOUT EVERY FIFTEEN 
YEARS. | 
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- HOWVER, WE CAN AROUS THAT EVERY BOOK 18 A°GAEN FOR THE ADVANCEMENT OF 
KNOWLEDGE. NEVERTHELESS, IT 18 TRUE THAT THE ACCELERATION RATE OF BOOK 
PUBLICATION 18, IN FACT, PARALLEL 70 THE RATE at WHECH WAM DISCOVEEES NEY 
KNOWLEDGE. 


THE RESULT OF ALL THESE PUBLICATION ACTIVITIES 1s ‘THAT TAR INDIVIDUAL 
SCIENTIST IS STRUGGLING IN ANOCEAN OF INFORMATION AND IS ENDRAVOURING TO SEEK 
THE INFORMATION HE NERDS. FURTHERMORE, AU, SOUREALS AND BOOKS ARE PUBLISHED Tn 
DIFFERENT NATIONAL LANGUAGES, BECAUSE NO ONE COUNTRY CAN. MOROPOLIZE ™ 
DEVELOPMENT OF SCIENCE AND TECHNOLOGY. THEN, HOW CAN "HE SCIENTIST OF ONE 
COUNTRY BE FAMILIAR WITH THE SCIENTIFIC. AND TECHNOLOGICAL DEVELOPMENTS AND 
RESEARCH FINDINGS TAKING PLACE IN OTHER COUNTRIES ? SHOULD WE INSIST THAT 

THE SCIENTIST BE ABLE TO READ ONE, TWO, OR MORE FORRIGN LANGUAGES, SO HE 

CAN READ THE CURREND LITERATURE RELATING 70 HLS OWN FIELD, TO SEARCH "HE . 
LITERATURE ACCUMULATED IN THE PAST, AND TO BE MADE AWARE OF WORK RELATED To 
HIS FIELD 80 THAT HE CAN PROCEED CONFIDENTLY IN HTS WORK TO AVOID UNINTENTIONAL — 
DUPLICATION. 


THE SO-CALLED " INFORMATION EXPLOSION " HAS ARRIVED. SUITABLE WAYS MUST - 
BE ESTABLISHED TO ASSIST THE SCIENTIST AND RESEARCH WORKER TO READ CURRENT - 
LITERATURE IN THE FIELD RELATING TO HIS WORK, 10 SHARE KNOWLEDGE AMONG PEOPLE. 
AND AMONG NATIONS. 1 BELIEVE THAT MACHINE TRANSLATION OFFERS THE ONLY REALISTIC 
SOLUTION TO THE PROBLEM WE ARE FACING. A GREAT DEAL OF TIME AND RESOURCES COULD 
BE SAVED IF DUPLICATION OF RESEARCH EFFORT WERE TO BE ELIMINATED, OR AT LEAST 
REDUCED TO A MINIMUM BY MUTUAL REPORTING OF PROGRESS AND EXCHANGING IDEAS. ‘THE 
WORK OF SCIENTISTS SPEAKING DIFFERENT LANGUAGES CAN BE ACCELERATED BY LEARNING 
WHAT SCIENTISTS IN OTHER COUNTRIES ARE DOING, THEIR ACHIEVEMENTS AND FATLURES., 
AND THEY CAN PLAN A BETTER COURSE OF ACTION TO SOLVE THEIR OWN PROBLEMS. ‘THE 
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ONLY MEANS BY WHICH THESE SCIENTIST! CAN COMMUNICATE WITH EACH OTHER IS BY 
PUBLISHING PAPERS OUTLINING THE METHOD USED ‘AND. ‘TRE RESULTS. ‘OBTAINED, AND 
THESE PAPERS HAVE TO BE TRANSLATED BEPORE THEY BECOME UNDERSTANDABLE TO THE 
OTHER SCIENTISIS. AUTOMATIC TRANSLATION IS DEFINDTELY NEEDED SINCE HUMAN 
TRANSLATORS CANNOT POSSIBLY HANDLE THE VOLUME OF WORK OR EVEN HOPE TO ACQUTRE 
HE SPECIAL VOCABULARY TO MAKE GOOD TRANSLATIONS IN A WIDE RANGE OF TECHNICAL 
SUBJECTS. 


AUTOMATIC TRANSLATION CAN NOT BE PERFECT, WHETHER IT COULD EVEN BE RIGH 
QUALITY OR NOT IS DEPENDENT ON HOW HIGH THE STANDARDS ARE SET. THE IMMEDLATE 
GOAL IS NOT TO DESTGN A PERFECT AUTOMATIC TRANSLATION SYSTEM OR TO ACHIEVE HIGH 
QUALITY MACHINE TRANSLATION, BUT TO DESIGN A MACHINE TRANSLATION SYSTEM THAT IS 
BETTER AND MORE EFFICIENT THAN THE ONES WE HAVE TO-DAY. 
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“THE SUCQBSTION ‘THAT MODERN COMPUTING MACK era 8 Co BE.UBED.FOR. 
TRANSLATION ORIGINATED FROM A.D. BOOTH. | IN 196, VARIOUS MEW USES FOR 


AUTOMATIC ‘DIGITAL COMPUTERS WERE BETNG CONSTDENED, ‘THESE APPLICATIONS RANGED 
FROM THR NORE OBVIOUS APPLICATORS TO PROBLEMS, a MATHEMATICS AND PHYSTCS,, TO 
PHILOSOPHICAL PROBLEMS SUCH AS MECHANIZATION OF WUMAN THOUGHT PROCESSES, THE. 

_ PLAYING OF GAMES AND THE TRANSLATION OF LANGUAGE. AT THIS EARLY STAGE or 

AUTOMATIC TRANSLATION, MOST OF THE RESEARCH WORK DONE IN AMERICA WAS. ocusep 
ON GERMAN INTO ENGLISH, WILE IN RUSSIA THEY-FOCUSED ON GERMAN TWTO RUSSIAN. 
THIS WAS PROBABLY’ BECAUSE. GERMANY HAD DEVELOPED VERY ADVANCED ‘TECHNOLOGY DURING 
WORLD WAR ny HOWEVER, ATTENTION WAS soot DIRECTED TOWARDS RUSSIAN-ENGLISH | 
TRANBLATION mm AMERICA. WITH REGARDS TO THE AUTOMATIC TRANSLATTON OF CHINESE, 
RUSSIA HAD GURY ATO” OP QING AND RESOURODS OF WEIS KESPRCY BEFORE-THE 
AMERICAN ‘HEALTZED-T'ts IMPORTANCE. 


GENERALLY SPEAKING; MACHINE ‘TRANSLATION BAS PASSED THROUGH THRE’ PHASES 
_ OF DEVELOPMENT. "THE FIRS®, OR HRUEISTIC PERIOD, BEGAN WITH BOOTH'S. SUOGBETION 
| ser, DURING THIS PERTOD, BOOTH AND RICHENS PERFORMED EXPERIMENTS: OK 
: “MICRO-GLOSSARY DESIGN WHILST OTHER RESEARCH GROUPS STUDIED MACHINE PRODUCED 
GRAMMATICAL RULES 10 ASSIST IN TRANSLATTON. — 


THE SECOND PERIOD STARTED AROUND: 1950 WHEN OWING REIYLER ( WASEENOTON 
| ‘UNIVERSITY ) SUGGESTED A LINGUISTIC ANALYSIS AND: PROPOSED: SOME TRANSLATION 
PROCEDURES. SUGGESTIONS ON THE USE OF PRE-EDITORS AND POBT-EDITORS AND A 

a le? OF LINGUISTIC TRICKS 70 HELP IN BOLVING MACHINE GRANSLATION PROBLENE | 
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|» WERE PROPOSED. WITH SUBSTANTTAL GOVERNMENT suPPORt, Se ae ciertens: 
MADE PROGRESS ON A HUMBER OF FRORES SIMULTANEOUSLY, — “nevi, ee a 


SECOND PERIOD WAS A-TIME OF GREAT DISAPPOTNTMEND, ‘THERE WERE A LOT OF 
UNBOLVED PROBLEMS WHICH SEEMED UNLIKELY 10 BE SOLVED ‘ai ae iN 
om ACHIEVING ANYTHING CLOSE TO A PERFECT SEASELATION AD DORCESD ET ORNS 
es) MACHYNE-ALDED TRANSLATION, 


uw, 


is “A ryt 


_ IN THE THIRD PERIOD, FROM EARLY 1960'S TO i966, SOME ipo: PRoGiihee: was ¥ 
MADE: amp aE IMPORTANCE OF LINGUIB'TEC RESEARCH WAS’ ‘REALISED. MORE efore WAS 
DIRECTED TOWARDS RESEARCH INTO SYNTAX AND . SIMANEEES TH ORDER ‘TO. SSUABLIER MORE 
PRBCTER GRAMMATICAL ‘RULES. 


"© EN 1967, THE AUTOMATIC LANGUAGE PROCESSING ADVISORY CONBEITTER oem 
paapnse OF DR. ‘HARWORTH, DIRECTOR OF THE NATIONAL SCIENCE FOUNDATION, ae 7 
_ ANROUNCED: ITS FAMOUS REPORT RECOMMENDING ‘THAT RESEARCH SUPPORT ar Ae E 
| a ve COMPUTATIONAL LINGUISTICS, BUT NOT FOR MACHINE ‘TRANSLATION WaICH, ms onern. : 
VIEW, WOULD HAVE NO POSSIBILITY OF TANGIBLE SUCCESS IN THE KEAR FUTURE. 
ALL RESEARCH IN MACHINE TRANSLATION WAS THUS TERMINATED . 


‘ 3 
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THE MACHINE TRANSLATION RESKARCL AT THE CEINESE UNIVERSITY OF HONG KONG 
BEGAN IN 1969, WE CAREFULLY STUDIMD AND REVIEWED RESEARCH REPORTS CONCERNING 
MACHINE TRANSLATION. THE ACCOMPLISHIENTS AND FAILURES OF PREVIOUS WORKS WERE 
EXAMINED. A NEW APPROACH 10 AUTOMATIC TRANSLATION OF LANGUAGE BY COMPUTER 
WAS PLANNED. 


WE RECOGNISED AT THE OUYSEL. ‘THAT FULLY AUTOMATIC HIGH QUALITY TRANSLATION 
(FAHQT) IS NOT FEASIBLE. A GREAT DEAL OF RESEARCH WORK, BOTH IN ‘THEORETICAL 
LINGUISTICS AND COMPUTER PROGRAMMING HAS TO BE DEVSLOPRD BEFORE THR PROBLING OF 
SPEED, ACCURACY » READABIT.ITY, QUALTTY AND ECONOMIC FACTORS OF AUTOMATIC 
TRANSLATION CAN Hi SOLVED. ACCORDING 10 BAR-HILLEL, “ HIGH QUALITY HAS 70 BE 
RELATIONIZED WITH REGARD TO USERS AND SITUATIONS. THOSE WHO ARE TATERESTED IN 
_ MACHINE TRANSLATION PRIMARILY AS PRACTICAL DEVICE MUST REALISE THAT FULL 
AUTOMATION OF TRANSLATION PROCESSES I8 INCOMPARTSLE WITH HICH QUALITY. THERE 
ARE TWO POSSIBLE DIRECTIONS IN WHTCH A COMPROMIS 3& COULD BE REACHED: OME 
COULD SACRIFICE QUALITY OR ONE COULD REDUCE THE SRLP-SUFFICIENCY OF THE 
MACHINE OUTPUT. " FAHY MUST THEREFORE BE SACRIFICED FOR THE TIME BEING, AND 


EFFORT SHOULD BE FOCUSED MORE ON A MACHINE-MAN PARTNERSHIP FOR TRANSLATION. 


RESEARCH INTO MACHINE TRANSLATION aT TH: CHINESE UNIVERSITY TAKES A 
DIFFERENT APPROACH ‘THAN “HE OTHERS IN THAT THE CHINESE UNIVERSITY OF BONG KONG 
PLACES A HRAVY FMPHASTS ON PRE-EDTTING THE SOURCE TENT tHSTEAD OF POST-EDITING 
THE TARGET TEXT. Ji TS THR ONLY GROUP TAKING THIS APPROACH OF COMPUTER-PRE~ 


EDITOR. PARUNERSHIY. ALL THE OTHER GROUPS, WHO REALISED THE FAHQT IS NOT REALLY 
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ATTAINABLE IN THE NEAR FUTURE AND SO A LESS sNiTrrous ath Is DSPINITELY 


INDICATE, HAD ADOFTED A TICTENCY DO COMENGLUGST IN FURGILG SOME COMFUTER-POST- 

EDITOR PARTNERSHIP. a 
ACCORDING TO BAR-HILLER AGAIN, "WHR ONLY ResSOMARLE AIM, THEN, FOR 

SHOTT-TERM RESEARCH INTO MI SiinS 10 BE THAT OF FINDING SOME MACHINE~POST-EDITOR 
PARTNERSHIP THAT WOULD BE COMMERCIALLY VIABLE WITH EXISTING HUMAN TRANSLATION 
AND THEN TO TRY TO IMPROVE THY COMMMRCTAL ETFECTIVENESS OF THIS PARTNERSHIP — 
BY IMPROVING THE PROGRAMMING IN ORDER TO DELEGATE TO THE. COMPUTER MORE AND NORE 
OPERATIONS IN THE TOTAL TRANSLATION PHOCESS WHICH TT CAN PERFORM MORE » 
"EFFECTIVELY THAN THE FOST-EDITOR. THESE IMPROVEMENTS WILL, OF COURSE, 

UNILIZE NOT OWLY DEVELOPMENTS IN HARDUARE, PROCRAMMING AND LINGUISTIC ANALYSIS, 
BUT ALSO THE EXPERIENCE GAINED BY ANALYSING THE MACHINE OUTPUT ITSELF ". THE 
RESEARCHERS AT CURK FEEL STRONGLY THAT IF " POST-EDITORS " MENTIONED ABOVE ARE 
_-GHANGED 10" PRE-EDTTORS ", THEN VHE SEMANTIC AND LEXICAL ANALYSIS WILL BE 

MICH SIMPLER AND EFFEGTIVE. IN THEORY, .IF WE CAN PRE-EDIT THE SORCE LANGUAGE ‘TO 
“MAKE IT AS COMPATIBLE TO THE STRUCTURE OF THE TARGH? LANGUAGE AS POSSTELE AND 
(HEN PROGRAM A SEP OF RULES FOR ANALYSIS AND SYUTRESIS, THEN THE TRANSLATED 
TEXTS OBTAINED WILL BE READADLE AND ACCURATE, TO QUOTE A.G. OFTTINGER : " 
BEFORE MACHINE INSTRUCTIONS FOR TRANSLATION CAll BE FORMULATED, THE SIGNIFICANT 
PATTERES OF THE SOURCE LANGUAGE AND TARGET LANGUAGE MUST BE PRECIGELY 
IDENTIFIED AND PUT INTO CORRESPONDENCE .... CORRESPONDING PATTERNS, THEREFORE, 
MUST DEFINED AS CONVEYORS OF EQUIVALENT MEANINGS SINCE, WHATEVER MRANING 8, 


If IS GENERALLY AGREED THAT IT MUST BE PRESERVED IN TRANSLATION. " 


A FIXED SET OF PRE-EDITING. RULES MUST THEREFORE BE FORMULATED TO ENABLE 

INEXPERIENCED AND EVEN MONO-LINGUAL PEOPLE TO TRANSFORM QUICKLY THE Input INTO 
-MACHINE~TRANSLATABLE FORM. WITH THIS ARRANGEMENT, POST<EDITING CAN BE KEPT TO 
A MINIMUM, IF NOT ALL TOGETHER ELIMINATED. GIVEN TIME AND BETTER PROGRAMMING 


heb AN A be 
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TECHNIQUES, THESE PRE-EDITING RULES WILL GRADUALLY<BE REDUCED 80 THAT THE 
COMPUTER WILL EVENTUALLY TAKE UP THIS ROUPINE WORK. PRE-EDIPING - CAN; DHEREFORE 
SOLVE MANY OF THE PRESENT LINGUISTIC PROBLEMS THAT ARE OTHERWISE DEPENDENT 
ON FURTHER RESEARCH IN NATURAL LANGUAGE, COMPUTATIONAL LINGUISTICS. AND: 
TRANSFORMATION MATHEMATICS. IN OTHER WORDS, MODELS THAT ARE MUCH MORE 
COMPREHENSIVE AND SOPHISTICATED THAN THE PRESENT ORRS HAVE TO BE DESIGNED. = 
THESE MODELS MAY TAKE YEARS TO PERFECT AND, Av PRESENT , Pre-eorenta 18 
ABSOLUTELY ESSENTIAL IN,ORDER TO ACHIEVE THE GOAL, IN PRESEN? STAGE OF-OUR 
DEVELOPMENT, VERY COMPLEX SERTENCES CAN BE TRANSLATED WITH THE ALD OF PRE 
"EDITING. A SENTENCE WHICH EAS A COMPLICATED STRUCTURE CAN BE. ANALYSED BY 
THE EXISTING PROGRAM IF IT 1S BROKEN UP INTO SIMPLER SENTEECRS WHICH ARE THEN 
READILY TRANSLATED BY THE COMPUTER. SENTENCES IN CHINESE ARE OPTEN WITHOUT 
VERBS OR SUBJECTS AND PRE-EDITING CAN ADD THE VERBS OR SUBJECTS SO THAT 
THESE SENTENCES CAN THUS BE ANALYSED AND TRANSLATED. — | 


CULT ( CHINESE UNIVERSITY LANGUAGE TRANSLATOR ) WAS DEVELOPED BASED on THE 
PRINCIPLE MENTIONED ABOVE AND HAS BEEN RIGOROUSLY EXAMINED AND TESTED. THE 
TRANSLATION ALGORITHM IS GIVEN IN FIG. 1. SINCE THE BEGINNING OF THIS YEAR, 
THE CULT SYSTEM HAS BEEN-USED ON A REGULAR BASIS TO TRANSLATED TWO CHINESE 
SCIENTIFIC JOURNALS, ACTA MATHEMATICA SINICA AND ACTA PHYSICA SINICA, WHICH 
ARE PUBLISHED BY THE PEKING ACADEMY OF SCIENCE, THIS ACCOMPLISHMENT 


INDICATE THE CORRECTNESS OF OUR APPROACH AND THE POTENITAL CAPABILITY OF CULT. | 
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_ TRANSLATION ALGORITHM 
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FIG. 1 a 


er oseal 


Approved For Release 2008/03/03 : CIA-RDP83M00171R001800120014-0 


Approved For Release 2008/03/03 : CIA-RDP83M00171R001800120014-0 


FUTURE 


IN THE NEXT FIVE TO TEN YEARS, DEVICES MIGHT BE AVAILABLE AT A REASONABLE 
COST FOR INPUTTING AND OUTPUTTING CHINESE OR OTHER NOQN-ALPHEBETIC CHARACTERS, 
THIS 1S, AT PRESENT, THE PRINCIPAL PROBLEM FACING THE PROCESSING OF LANGUAGES 
OTHER THAN THOSE USING ALPHABETS, 


MORE LINGUISTIC RESEARCH APPLICADLE TQ MACHINE TRANSLATION MAY YIELD 
USEFUL RESULTS SO THAT MORE COMPREHENSIVE GRAMMATICAL RULES AND SENTENCE 
ge 
STRUCTURES BETWEEN LANGUAGES CAN,FORMULATED. 


“ 


AT THE CHINESE UNIVERSITY, WE ARE AT THE PRESENT WORKING ON A LANGUAGE 
TRANSLATOR WHICH WILL ENABLE US TO TRANSLATE CHINESE INTO ENGILISH AS WELL AS 
TO TRANSLATE ENGLISH INTO CHINESE, A SIMULATION STUDY HAS BEEN DONE WHICH 
INDICATES THE POSSIBILITY OF SUCCESS, WE ARE HOPING THAT, IN A FEW MONTHS, A 
MODEL MAY BE DESIGNED To DEMONSTRATE THE SYSTEM. IF {T ¥S SUCCESSFUL, WE 
ARE OF THE OPINION THAT A UNIVERSAL TRANSLATOR MAY BE DESIGNED SO THAT ONLY 
ONE TRANSLATOR WILL BE. REQUIRED TO. DO TRANSLATION BETWEEN LANGUAGES, 


ASK NOT WHAT THE COMPUTER CAN DO FOR YOU. ASK WHAT YOU, WITH THE AID 
OF THE COMPUTER, CAN DO FOR THE BETTERMENT OF THE. MANKIND. 


ee 
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